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(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc., Hsei, Vanessa 
Koumenis, Iphigenia 
Leong , Steven R . 
Presta, Leonard G . 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-5530 

(B) TELEFAX: 650/952-9881 
(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



CAGTCCAACT GTTCAGGACG CC 2 2 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 
\' (A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 



GTGCTGCTCA TGCTGTAGGT GC 2 2 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



GAAGTTGATG TCTTGTGAGT GGC 2 3 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
{ B } TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

( D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



GCATCCTAGA GTCACCGAGG AGCC 2 4 



(2) INFORMATION FOR SEQ ID NO : 5 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CACTGGCTCA GGGAAATAAC CC 2 2 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



GGAGAGCTGG GAAGGTGTGC AC 2 2 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



ACAAACGCGT ACGCTGACAT CGTCATGACC CAGTC 3 5 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



ACAAACGCGT ACGCTGATAT TGTCATGACT CAGTC 3 5 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



ACAAACGCGT ACGCTGACAT CGTCATGACA CAGTC 3 5 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



GCTCTTCGAA TGGTGGGAAG ATGGATACAG TTGGTGC 3 7 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CGATGGGCCC GG AT AG AC CG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CGATGGGCCC GGATAGACTG ATGGGGCTGT CGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGATGGGCCC GG AT AG AC TG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GACATTGTCA TGACACAGTC TCAAAAATTC ATGT C C AC AT CAGTAGGAGA 5 0 

CAGGGTCAGC GTCACCTGCA AGGCCAGTCA GAATGTGGGT ACTAATGTAG 10 0 

CCTGGTATCA ACAGAAACCA GGGCAATCTC CTAAAGCACT GATTTACTCG 150 

TCATCCTACC GGTACAGTGG AGTCCCTGAT CGCTTCACAG GCAGTGGATC 2 00 

TGGGACAGAT TTCACTCTCA CCATCAGCCA TGTGCAGTCT GAAGACTTGG 2 50 

CAGACTATTT CTGTCAGCAA TATAACATCT ATCCTCTCAC GTTCGGTCCT 300 

GGGACCAAGC TGGAGTTGAA ACGGGCTGAT GCTGCACCAC CAACTGTATC 350 
CATCTTCCCA CCATTCGAA 369 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asp lie Val Met Thr Gin Ser Gin Lys Phe Met Ser Thr Ser Val 

15 10 15 

Gly Asp Arg Val Ser Val Thr Cys Lys Ala Ser Gin Asn Val Gly 

20 25 30 

Thr Asn Val Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys 

35 40 45 

Ala Leu lie Tyr Ser Ser Ser Tyr Arg Tyr Ser Gly Val Pro Asp 

50 55 60 

Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie 

65 70 75 

Ser His Val Gin Ser Glu Asp Leu Ala Asp Tyr Phe Cys Gin Gin 

80 85 90 

Tyr Asn lie Tyr Pro Leu Thr Phe Gly Pro Gly Thr Lys Leu Glu 

95 100 105 

Leu Lys Arg Ala Asp Ala Ala Pro Pro Thr Val Ser lie Phe Pro 

110 115 120 



Pro Phe Glu 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TTCTATTGCT ACAAACGCGT ACGCTGAGGT GCAGCTGGTG GAGTCTGGGG 5 0 

GAGGCTTAGT GCCGCCTGGA GGGTCCCTGA AACTCTCCTG TGCAGCCTCT 10 0 

GGATTCATAT TCAGTAGTTA TGGCATGTCT TGGGTTCGCC AGACTCCAGG 150 

CAAGAGCCTG GAGTTGGTCG CAACCATTAA TAATAATGGT GATAGCACCT 2 00 

ATTATCCAGA CAGTGTGAAG GGCCGATTCA CCATCTCCCG AGACAATGCC 250 

AAGAACACCC TGTACCTGCA AATGAGCAGT CTGAAGTCTG AGGACACAGC 3 00 
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CATGTTTTAC TGTGCAAGAG CCCTCATTAG TTCGGCTACT TGGTTTGGTT 3 50 
ACTGGGGCCA AGGGACTCTG GTCACTGTCT CTGCAGCCAA AACAACAGCC 400 
CCATCTGTCT ATCCGGG 417 
(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Pro Pro Gly 
15 10 15 

Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe lie Phe Ser 
20 25 30 

Ser Tyr Gly Met Ser Trp Val Arg Gin Thr Pro Gly Lys Ser Leu 
35 40 45 

Glu Leu Val Ala Thr lie Asn Asn Asn Gly Asp Ser Thr Tyr Tyr 
50 55 60 

Pro Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala 
65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp 
80 85 90 

Thr Ala Met Phe Tyr Cys Ala Arg Ala Leu lie Ser Ser Ala Thr 
95 100 105 

Trp Phe Gly Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ala 
110 115 120 

Ala Lys Thr Thr Ala Pro Ser Val Tyr Pro 
125 130 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 



ACAAACGCGT ACGCTGATAT CGTCATGACA G 31 
(2) INFORMATION FOR SEQ ID NO : 2 1 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 



GCAGCATCAG CTCTTCGAAG CTCCAGCTTG G 31 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



CCACTAGTAC GCAAGTTCAC G 21 
(2) INFORMATION FOR SEQ ID NO : 2 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 3 : 



GATGGGCCCT TGGTGGAGGC TGCAGAGACA GTG 3 3 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

TGCTACAAAC GCGTACGCTG ATATCGTCAT GACACAGTCT CAAAAATTCA 10 0 

TGTCCACATC AGTAGGAGAC AGGGTCAGCG TCACCTGCAA GGCCAGTCAG 15 0 

AATGTGGGTA CTAATGTAGC CTGGTATCAA C AGAAAC C AG GGCAATCTCC 200 

TAAAGCACTG ATTTACTCGT CATCCTACCG GTACAGTGGA GTCCCTGATC 250 
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GCTTCACAGG CAGTGGATCT GGGACAGATT TCACTCTCAC CATCAGCCAT 300 
GTGCAGTCTG AAGACTTGGC AGACTATTTC TGTCAGCAAT ATAACATCTA 3 50 
TCCTCTCACG TTCGGTCCTG GGACCAAGCT GGAGCTTCGA AGAGCTGTGG 4 00 
CTGCACCATC TGTCTTCATC TTCGCGCCAT CTGATGAGCA GTTGAAATCT 4 50 
GGAACTGCTT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
AGAGTGTCAC AG AG C AGG AC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 6 50 
CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 
GGGGAGAGTG TTAA 7 14 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 : 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Val Met Thr Gin Ser 
20 25 30 

Gin Lys Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser Val Thr 
35 40 45 

Cys Lys Ala Ser Gin Asn Val Gly Thr Asn Val Ala Trp Tyr Gin 
50 55 60 

Gin Lys Pro Gly Gin Ser Pro Lys Ala Leu lie Tyr Ser Ser Ser 
65 70 75 

Tyr Arg Tyr Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Ser 
8 0 8 5 9 0 

Gly Thr Asp Phe Thr Leu Thr lie Ser His Val Gin Ser Glu Asp 
95 100 105 

Leu Ala Asp Tyr Phe Cys Gin Gin Tyr Asn lie Tyr Pro Leu Thr 
110 115 120 

Phe Gly Pro Gly Thr Lys Leu Glu Leu Arg Arg Ala Val Ala Ala 
125 130 135 



Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 
155 160 165 

Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
200 205 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
215 220 225 

Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 756 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



ATGAAAAAGA AT AT CGC ATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

TGCTACAAAC GCGTACGCTG AGGTGCAGCT GGTGGAGTCT GGGGGAGGCT 10 0 

TAGTGCCGCC TGGAGGGTCC CTGAAACTCT CCTGTGCAGC CTCTGGATTC 15 0 

ATATTCAGTA GTTATGGCAT GTCTTGGGTT CGCCAGACTC CAGGCAAGAG 200 

CCTGGAGTTG GTCGCAACCA TTAATAATAA TGGTGATAGC ACCTATTATC 2 50 

CAGACAGTGT GAAGGGCCGA TTCACCATCT CCCGAGACAA TGCCAAGAAC 300 

ACCCTGTACC TGCAAATGAG CAGTCTGAAG TCTGAGGACA CAGCCATGTT 3 50 

TTACTGTGCA AGAGCCCTCA TTAGTTCGGC TACTTGGTTT GGTTACTGGG 4 00 

GCCAAGGGAC TCTGGTCACT GTCTCTGCAG CCTCCACCAA GGGCCCATCG 4 50 

GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG GCACAGCGGC 50 0 

CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 55 0 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA 600 

CAGTCCTCAG GACTCTACTC CCTCAGCAGC GTGGTGACCG TGCCCTCCAG 6 50 



CAGCTTGGGC ACCCAGACCT ACATCTGCAA CGTGAATCAC AAGCCCAGCA 700 
ACACCAAGGT GGACAAGAAA GTTGAGCCCA AATCTTGTGA CAAAACTCAC 7 50 
ACATGA 756 

(2) INFORMATION FOR SEQ ID NO : 2 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 

20 25 30 

Gly Gly Gly Leu Val Pro Pro Gly Gly Ser Leu Lys Leu Ser Cys 

35 40 45 

Ala Ala Ser Gly Phe lie Phe Ser Ser Tyr Gly Met Ser Trp Val 

50 55 60 

Arg Gin Thr Pro Gly Lys Ser Leu Glu Leu Val Ala Thr lie Asn 

65 70 75 

Asn Asn Gly Asp Ser Thr Tyr Tyr Pro Asp Ser Val Lys Gly Arg 

80 85 90 

Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr Leu Gin 

95 100 105 

Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Met Phe Tyr Cys Ala 

110 115 120 

Arg Ala Leu lie Ser Ser Ala Thr Trp Phe Gly Tyr Trp Gly Gin 

125 130 135 

Gly Thr Leu Val Thr Val Ser Ala Ala Ser Thr Lys Gly Pro Ser 

140 145 150 

Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 

155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 

170 175 180 

Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 

185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 

200 205 210 



Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr lie 
215 220 225 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 

(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 



CCAATGCATA CGCTGACATC GTGATGACCC AGACCCC 3 7 
(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 



CCAATGCATA CGCTGATATT GTGATGACTC AGACTCC 3 7 
(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 



CCAATGCATA CGCTGACATC GTGATGACAC AG AC AC C 3 7 
(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 

AGATGTCAAT TGCTCACTGG ATGGTGGGAA GATGG 3 5 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CAAACGCGTA CGCTGAGATC CAGCTGCAGC AG 3 2 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CAAACGCGTA CGCTG AG ATT CAGCTCCAGC AG 3 2 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GATATCGTGA TGACACAGAC ACCACTCTCC CTGCCTGTCA GTCTTGGAGA 5 0 

TCAGGCCTCC ATCTCTTGCA GATCTAGTCA GAGCCTTGTA CACGGTATTG 10 0 

GAAACACCTA TTTACATTGG TACCTGCAGA AGCCAGGCCA GTCTCCAAAG 150 

CTCCTGATCT ACAAAGTTTC CAACCGATTT TCTGGGGTCC CAGACAGGTT 200 

CAGTGGCAGT GGATCAGGGA CAGATTTCAC ACTCAGGATC AGCAGAGTGG 2 50 

AGGCTGAGGA TCTGGGACTT TATTTCTGCT CTCAAAGTAC ACATGTTCCG 3 00 

CTCACGTTCG GTGCTGGGAC CAAGCTGGAG CTGAAACGGG CTGATGCTGC 3 50 

ACCAACTGTA TCCATCTTCC CACCATCCAG TGAGCAATTG A 3 91 



(2) INFORMATION FOR SEQ ID NO : 3 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 
{ B } TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
3 5 4 0 4 5 

Gly Gin Ser Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 
50 55 60 

Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
65 70 75 

Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly Leu 
80 85 90 

Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Ala 
95 100 105 

Gly Thr Lys Leu Glu Leu Lys Arg Ala Asp Ala Ala Pro Thr Val 
110 115 120 

Ser lie Phe Pro Pro Ser Ser Glu Gin Leu Lys 
125 130 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GAGATTCAGC TGCAGCAGTC TGGACCTGAG CTGATGAAGC CTGGGGCTTC 5 0 

AGTGAAGATA TCCTGCAAGG CTTCTGGTTA TTCATTCAGT AGCCACTACA 10 0 

TGCACTGGGT GAAGCAGAGC CATGGAAAGA GCCTTGAGTG GATTGGCTAC 15 0 

ATTGATCCTT CCAATGGTGA AACTACTTAC AACCAGAAAT TCAAGGGCAA 200 

GGCCACATTG ACTGTAGACA CATCTTCCAG CACAGCCAAC GTGCATCTCA 2 50 



GCAGCCTGAC ATCTGATGAC TCTGCAGTCT ATTTCTGTGC AAGAGGGGAC 300 
TATAGATACA ACGGCGACTG GTTTTTCGAT GTCTGGGGCG CAGGGACCAC 3 50 
GGTCACCGTC TCCTCCGCCA AAACCGACAG CCCCATCGGT CTATCCGGGC 4 00 
CCATC 4 05 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 7 : 

Glu He Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

Ala Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 
20 25 30 

Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 
35 40 45 

Glu Trp He Gly Tyr He Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 

Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 
65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 
80 85 90 

Ser Ala Val Tyr Phe Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 
95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr Thr Val Thr Val 
110 115 120 

Ser Ser Ala Lys Thr Asp Ser Pro He Gly Leu Ser Gly Pro He 
125 130 135 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 



CTTGGTGGAG GCGGAGGAGA CG 2 2 



(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 



GAAACGGGCT GTTGCTGCAC CAACTGTATT CATCTTCC 3 8 
(2) INFORMATION FOR SEQ ID NO : 4 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 
<C) STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 : 



GTCACCGTCT CCTCCGCCTC CACCAAGGGC C 31 
(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 



ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

TGCTACAAAT GCATACGCTG ATATCGTGAT GACACAGACA CCACTCTCCC 10 0 

TGCCTGTCAG TCTTGGAGAT CAGGCCTCCA TCTCTTGCAG ATCTAGTCAG 15 0 

AGCCTTGTAC ACGGTATTGG AAACACCTAT TTACATTGGT ACCTGCAGAA 2 00 

GCCAGGCCAG TCTCCAAAGC TCCTGATCTA CAAAGTTTCC AACCGATTTT 2 50 

CTGGGGTCCC AGACAGGTTC AGTGGCAGTG GATCAGGGAC AGATTTCACA 3 00 

CTCAGGATCA GCAGAGTGGA GGCTGAGGAT CTGGGACTTT ATTTCTGCTC 3 50 

TCAAAGTACA CATGTTCCGC TCACGTTCGG TGCTGGGACC AAGCTGGAGC 4 00 

TGAAACGGGC TGTTGCTGCA CCAACTGTAT TCATCTTCCC AC CAT C C AG T 4 50 



GAGCAATTGA AATCTGGAAC TGCCTCTGTT GTGTGCCTGC TGAATAACTT 500 



CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 55 0 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 6 50 
CAAAGTCTAC GCCTGCGAAG TCACCCATGA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAA 72 9 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Val Met Thr Gin Thr 
20 25 30 

Pro Leu Ser Leu Pro Val Ser Leu Gly Asp Gin Ala Ser lie Ser 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Asn Thr Tyr 
50 55 60 

Leu His Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu 
65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe 
80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Arg lie Ser Arg 
95 100 105 

Val Glu Ala Glu Asp Leu Gly Leu Tyr Phe Cys Ser Gin Ser Thr 
110 115 120 

His Val Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys 
125 130 135 

Arg Ala Val Ala Ala Pro Thr Val Phe lie Phe Pro Pro Ser Ser 
140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 



185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
230 235 240 

Glu Cys 



(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

TGCTACAAAC GCGTACGCTG AGATTCAGCT GCAGCAGTCT GGACCTGAGC 10 0 

TGATGAAGCC TGGGGCTTCA GTGAAGATAT CCTGCAAGGC TTCTGGTTAT 15 0 

TCATTCAGTA GCCACTACAT GCACTGGGTG AAG C AG AG CC ATGGAAAGAG 2 00 

CCTTGAGTGG ATTGGCTACA TTGATCCTTC CAATGGTGAA ACTACTTACA 2 50 

ACCAGAAATT CAAGGGCAAG GCCACATTGA CTGTAGACAC ATCTTCCAGC 300 

ACAGCCAACG TGCATCTCAG CAGCCTGACA TCTGATGACT CTGCAGTCTA 3 50 

TTTCTGTGCA AGAGGGGACT ATAGATACAA CGGCGACTGG TTTTTCGATG 4 00 

TCTGGGGCGC AGGGACCACG GTCACCGTCT CCTCCGCCTC CACCAAGGGC 4 50 

CCATCGGTCT TCCCCCTGGC ACCCTCCTCC AAG AG C AC C T CTGGGGGCAC 50 0 

AGCGGCCCTG GGCTGCCTGG TCAAGGACTA CTTCCCCGAA CCGGTGACGG 550 

TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC CTTCCCGGCT 600 

GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TGACCGTGCC 650 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC 700 

CCAGCAACAC CAAGGTGGAC AAGAAAGTTG AGCCCAAATC TTGTGACAAA 7 50 
ACTCACACAT GA 7 62 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu lie Gin Leu Gin Gin Ser 

20 25 30 

Gly Pro Glu Leu Met Lys Pro Gly Ala Ser Val Lys lie Ser Cys 

35 40 45 

Lys Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

Lys Gin Ser His Gly Lys Ser Leu Glu Trp lie Gly Tyr lie Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Lys 

80 85 90 

Ala Thr Leu Thr Val Asp Thr Ser Ser Ser Thr Ala Asn Val His 

95 100 105 

Leu Ser Ser Leu Thr Ser Asp Asp Ser Ala Val Tyr Phe Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 21C 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 
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Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 

(2) INFORMATION FOR SEQ ID NO : 4 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
35 40 45 

Gly Gin Ser Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 
50 55 60 

Phe Ser Gly Val Pro Asp Arg Phe Ser Asp Ser Gly Ser Gly Thr 
65 70 75 

Asp Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly 
80 85 90 

Leu Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 
95 100 105 

Ala Gly Thr Lys Leu Glu Leu Lys Arg 
110 

(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

Gly Asp Arg Val Thr lie Thr Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 
35 40 45 



Gly Lys Ala Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 

50 55 60 



Phe Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

65 70 75 

Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Phe Ala 

80 85 90 

Thr Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 

95 100 105 

Gin Gly Thr Lys Val Glu lie Lys Arg 
110 

(2) INFORMATION FOR SEQ ID NO : 4 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 : 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

15 10 15 

Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser Lys Thr lie Ser 

20 25 30 

Lys Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 

35 40 45 

Leu Leu lie Tyr Tyr Ser Gly Ser Thr Leu Glu Ser Gly Val Pro 

50 55 60 

Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 

65 70 75 

lie Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin 

80 85 90 

Gin His Asn Glu Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val 

95 100 105 

Glu lie Lys Arg 



(2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: Amino Acid 
( D } TO POLOG Y : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 



Glu He Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 

15 10 15 



Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 

Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 

3 5 4 0 4 5 

Glu Trp lie Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 

65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 

80 85 90 

Ser Ala Val Tyr Phe Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 105 

Gly Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr 
110 115 

(2) INFORMATION FOR SEQ ID NO : 4 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 

15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 

Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

Asn Gin Lys Phe Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 

65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 105 

Gly Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr 
110 115 

(2) INFORMATION FOR SEQ ID NO : 5 0 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 Amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY : Linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Thr 
20 25 30 

Gly His Trp Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
35 40 45 

Glu Trp Val Gly Met lie His Pro Ser Asp Ser Glu Thr Arg Tyr 
50 55 60 

Ala Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 
65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly lie Tyr Phe Tyr Gly 
95 100 105 

Thr Thr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr 
110 115 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 

20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Asn Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 
65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

80 85 90 
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Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 

95 100 105 

Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 

125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 

140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 

230 235 240 



Glu Cys 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 -70 75 
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Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr Glu Lys Met 
15 10 15 

Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala Asp Glu Asn 

20 25 30 

Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser Val Ala Thr 
35 40 45 

Asp Tyr Gly Ala Ala lie Asp Gly Phe lie Gly Asp Val Ser Gly 

50 55 60 
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Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala Gly Ser Ser 
65 70 75 

Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser Pro Leu 
80 85 90 

Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin Ser Val 
95 100 105 

Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Glu Phe 
110 115 120 

Ser lie Asp Cys Asp Lys lie Asn Leu Phe Arg Gly Val Phe Ala 
125 130 135 

Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser Thr Phe 
140 145 150 

Ala Asn lie Leu Arg Asn Lys Glu Ser 
155 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 

TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 15 0 

AGCTTAGTAC ATGGTATAGG TAACACGTAT TTACACTGGT ATCAACAGAA 2 00 

ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 2 50 

CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 

CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 3 50 

ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 4 50 

GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 

CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 55 0 

CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 6 00 
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TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 65 0 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 7 50 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 78 0 
{2} INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

Lys Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
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4 * 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 2 40 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 

245 250 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 

20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 

35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Ala Thr Tyr 

50 55 60 

Leu His Tx-p Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 

95 100 105 

Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 

125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 

140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
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185 



190 
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Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 

230 235 240 



Glu Cys 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys 
15 10 15 

Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His 
20 25 30 

Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 
TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 15 0 
AGCTTAGTAC ATGGTATAGG TGCTACGTAT TTACACTGGT ATCAACAGAA 20 0 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 2 50 
CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 3 00 



CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 3 50 



ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 4 00 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 4 50 

GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 

CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 5 50 

CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 6 00 

TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 6 50 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGGCCGTCA 7 00 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 7 50 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 
(2) INFORMATION FOR SEQ ID NO : 5 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAAGGGTAT CTAGAGGTTG AGGTGATTTT AT G AAAAAG A AT AT C G CAT T 5 0 

TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 100 

AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 15 0 

CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 2 00 

GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 2 50 

TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 3 00 

TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 3 50 

CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 4 00 

ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 4 50 

GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 

ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 55 0 

TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 

CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 6 50 

CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 



AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCGTG 80 0 
CCCAGCACCA GAACTGCTGG GCGGCCGCAT GAAACAGCTA GAGGACAAGG 850 
TCGAAGAGCT ACTCTCCAAG AACTACCACC TAGAGAATGA AGTGGCAAGA 90 0 
CTCAAAAAGC TTGTCGGGGA GCGCTAA 92 7 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 0 : 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
8 0 8 5 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 



His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
245 250 255 

Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys Gin Leu 
260 265 270 

Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu 
275 280 285 

Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
290 295 

(2) INFORMATION FOR SEQ ID NO : 6 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6563 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 5 0 
TCATTGCTGA GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAT 100 
GAACTGTGTG CGCAGGTAGA AGCTTTGGAG ATTATCGTCA CTGCAATGCT 15 0 
TCGCAATATG GCGCAAAATG ACCAACAGCG GTTGATTGAT CAGGTAGAGG 200 
GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA CGACGATACG 25 0 
GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 3 00 
AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT 3 50 
ATAGTCGCTT TGTTTTTATT TTTTAATGTA TTTGTAACTA GAATTCGAGC 400 
TCGGTACCCG GGGATCCTCT CGAGGTTGAG GTGATTTTAT GAAAAAGAAT 450 
ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG CTACAAACGC 50 0 
ATACGCTGAT ATCCAGATGA CCCAGTCCCC GAGCTCCCTG TCCGCCTCTG 55 0 
TGGGCGATAG GGTCACCATC ACCTGCAGGT CAAGTCAAAG CTTAGTACAT 60 0 
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GGTATAGGTG CTACGTATTT ACACTGGTAT CAACAGAAAC CAGGAAAAGC 6 50 

TCCGAAACTA CTGATTTACA AAGTATCCAA TCGATTCTCT GGAGTCCCTT 70 0 

CTCGCTTCTC TGGATCCGGT TCTGGGACGG ATTTCACTCT GACCATCAGC 75 0 

AGTCTGCAGC CAGAAGACTT CGCAACTTAT TACTGTTCAC AGAGTACTCA 80 0 

TGTCCCGCTC ACGTTTGGAC AGGGTACCAA GGTGGAGATC AAACGAACTG 8 50 

TGGCTGCACC ATCTGTCTTC ATCTTCCCGC CATCTGATGA GCAGTTGAAA 90 0 

TCTGGAACTG CTTCTGTTGT GTGCCTGCTG AATAACTTCT ATCCCAGAGA 95 0 

GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 10 0 0 

AGGAGAGTGT C AC AG AG C AG GACAGCAAGG ACAGCACCTA CAGCCTCAGC 10 50 

AGCACCCTGA CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC 110 0 

CTGCGAAGTC ACCCATCAGG GCCTGAGCTC GCCCGTCACA AAGAGCTTCA 115 0 

ACAGGGGAGA GTGTTAAGCT GATCCTCTAC GCCGGACGCA TCGTGGCCCT 12 0 0 

AGTACGCAAC TAGTCGTAAA AAGGGTATCT AGAGGTTGAG GTGATTTTAT 12 5 0 

GAAAAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG 13 0 0 

CTACAAACGC GTACGCTGAG GTTCAGCTAG TGCAGTCTGG CGGTGGCCTG 13 5 0 

GTGCAGCCAG GGGGCTCACT CCGTTTGTCC TGTGCAGCTT CTGGCTACTC 14 0 0 

CTTCTCGAGT CACTATATGC ACTGGGTCCG TCAGGCCCCG GGTAAGGGCC 14 5 0 

TGGAATGGGT TGGATATATT GATCCTTCCA ATGGTGAAAC TACGTATAAT 150 0 

CAAAAGTTCA AGGGCCGTTT CACTTTATCT CGCGACAACT C C AAAAAC AC 15 5 0 

AGCATACCTG CAGATGAACA GCCTGCGTGC TGAGGACACT GCCGTCTATT 16 0 0 

ACTGTGCAAG AGGGGATTAT CGCTACAATG GTGACTGGTT CTTCGACGTC 16 5 0 

TGGGGTCAAG GAACCCTGGT CACCGTCTCC TCGGCCTCCA CCAAGGGCCC 17 0 0 

ATCGGTCTTC CCCCTGGCAC CCTCCTCCAA GAGCACCTCT GGGGGCACAG 17 5 0 

CGGCCCTGGG CTGCCTGGTC AAGGACTACT TCCCCGAACC GGTGACGGTG 18 00 

TCGTGGAACT CAGGCGCCCT GACCAGCGGC GTGCACACCT TCCCGGCTGT 18 5 0 

CCTACAGTCC TCAGGACTCT ACTCCCTCAG CAGCGTGGTG ACCGTGCCCT 190 0 

CCAGCAGCTT GGGCACCCAG ACCTACATCT GCAACGTGAA TCACAAGCCC 195 0 

AGCAACACCA AGGTCGACAA GAAAGTTGAG CCCAAATCTT GTGACAAAAC 2 0 00 

TCACACATGC CCGCCGTGCC CAGCACCAGA ACTGCTGGGC GGCCGCATGA 2 0 50 
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AACAGCTAGA GGACAAGGTC GAAGAGCTAC 
GAGAATGAAG TGGCAAGACT CAAAAAGCTT 
CGACGGCCCT AGAGTCCCTA ACGCTCGGTT 
GTTAACTCAT GTTTGACAGC TTATCATCGA 
TATCACAGTT AAATTGCTAA CGCAGTCAGG 
AATGCGCTCA TCGTCATCCT CGGCACCGTC 
AGGCTTGGTT ATGCCGGTAC TGCCGGGCCT 
CCGACAGCAT CGCCAGTCAC TATGGCGTGC 
ATGCAATTTC TATGCGCACC CGTTCTCGGA 
CCGCCGCCCA GTCCTGCTCG CTTCGCTACT 
CGATCATGGC GACCACACCC GTCCTGTGGA 
GTGGCCGGCA TCACCGGCGC CACAGGTGCG 
CGACATCACC GATGGGGAAG ATCGGGCTCG 
CTTGTTTCGG CGTGGGTATG GTGGCAGGCC 
GGCGCCATCT CCTTGCACGC ACCATTCCTT 
CCTCAACCTA CTACTGGGCT GCTTCCTAAT 
AGCGTCGTCC GATGCCCTTG AGAGCCTTCA 
TGGGCGCGGG GCATGACTAT CGTCGCCGCA 
CATGCAACTC GTAGGACAGG TGCCGGCAGC 
AGGACCGCTT TCGCTGGAGC GCGACGATGA 
TTCGGAATCT TGCACGCCCT CGCTCAAGCC 
CAAACGTTTC GGCGAGAAGC AGGCCATTAT 
CGCTGGGCTA CGTCTTGCTG GCGTTCGCGA 
CCCATTATGA TTCTTCTCGC TTCCGGCGGC 
GGCCATGCTG TCCAGGCAGG TAGATGACGA 
GATCGCTCGC GGCTCTTACC AGCCTAACTT 
GTCACGGCGA TTTATGCCGC CTCGGCGAGC 
GATTGTAGGC GCCGCCCTAT ACCTTGTCTG 



TCTCCAAGAA CTACCACCTA 210 0 

GTCGGGGAGC GCTAAGCATG 2150 

GCCGCCGGGC GTTTTTTATT 22 0 0 

TAAGCTTTAA TGCGGTAGTT 22 5 0 

CACCGTGTAT GAAATCTAAC 2 3 00 

ACCCTGGATG CTGTAGGCAT 2 3 50 

CTTGCGGGAT ATCGTCCATT 24 0 0 

TGCTAGCGCT ATATGCGTTG 24 5 0 

GCACTGTCCG ACCGCTTTGG 2500 

TGGAGCCACT ATCGACTACG 2 55 0 

TCCTCTACGC CGGACGCATC 2600 

GTTGCTGGCG CCTATATCGC 2 6 50 

CCACTTCGGG CTCATGAGCG 2700 

CCGTGGCCGG GGGACTGTTG 2 7 50 

GCGGCGGCGG TGCTCAACGG 2800 

GC AG GAGTCG CATAAGGGAG 28 50 

ACCCAGTCAG CTCCTTCCGG 2 900 

CTTATGACTG TCTTCTTTAT 2 95 0 

GCTCTGGGTC ATTTTCGGCG 3 000 

TCGGCCTGTC GCTTGCGGTA 3 050 

TTCGTCACTG GTCCCGCCAC 310 0 

CGCCGGCATG GCGGCCGACG 315 0 

CGCGAGGCTG GATGGCCTTC 3 200 

ATCGGGATGC CCGCGTTGCA 32 5 0 

CCATCAGGGA CAGCTTCAAG 3 3 00 

CGATCACTGG ACCGCTGATC 3 3 50 

ACATGGAACG GGTTGGCATG 34 0 0 

CCTCCCCGCG TTGCGTCGCG 3 4 50 
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GTGCATGGAG CCGGGCCACC TCGACCTGAA TGGAAGCCGG CGGCACCTCG 3 50 0 
CTAACGGATT CACCAOTCCA AGAATTGGAG CCAATCAATT CTTGCGGAGA 3 55 0 
ACTGTGAATG CGCAAACCAA CCCTTGGCAG AACATATCCA TCGCGTCCGC 3600 
CATCTCCAGC AGCCGCACGC GGCGCATCTC GGGCAGCGTT GGGTCCTGGC 3650 
CACGGGTGCG CATGATCGTG CTCCTGTCGT TGAGGACCCG GCTAGGCTGG 3700 
CGGGGTTGCC TTACTGGTTA GCAGAATGAA TCACCGATAC GCGAGCGAAC 3 7 50 
GTGAAGCGAC TGCTGCTGCA AAACGTCTGC GACCTGAGCA ACAACATGAA 3800 
TGGTCTTCGG TTTCCGTGTT TCGTAAAGTC TGGAAACGCG GAAGTCAGCG 3 8 50 
CCCTGCACCA TTATGTTCCG GATCTGCATC GCAGGATGCT GCTGGCTACC 3 90 0 
CTGTGGAACA CCTACATCTG TATTAACGAA GCGCTGGCAT TGACCCTGAG 3 95 0 
TGATTTTTCT CTGGTCCCGC CGCATCCATA CCGCCAGTTG TTTACCCTCA 4 000 
CAACGTTCCA GTAACCGGGC ATGTTCATCA TCAGTAACCC GTATCGTGAG 4 050 
CATCCTCTCT CGTTTCATCG GTATCATTAC CCCCATGAAC AGAAATTCCC 4100 
CCTTACACGG AGGCATCAAG TGACCAAACA G G AAAAAAC C GCCCTTAACA 4150 
TGGCCCGCTT TATCAGAAGC CAGACATTAA CGCTTCTGGA GAAACTCAAC 420 0 
GAGCTGGACG CGGATGAACA GGCAGACATC TGTGAATCGC TTCACGACCA 4250 
CGCTGATGAG CTTTACCGCA GCTGCCTCGC GCGTTTCGGT GATGACGGTG 4 3 00 
AAAACCTCTG ACACATGCAG CTCCCGGAGA CGGTCACAGC TTGTCTGTAA 4 3 50 
GCGGATGCCG GGAGCAGACA AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG 44 0 0 
CGGGTGTCGG GGCGCAGCCA TGACCCAGTC AGGTAGCGAT AGCGGAGTGT 44 5 0 
ATACTGGCTT AACTATGCGG GATCAGAGCA GATTGTACTG AGAGTGCACC 4 500 
ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA ATACCGCATC 4 5 50 
AGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 4 6 00 
GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA 4 6 50 
CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA 4 70 0 
AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 4^50 
CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 4800 
GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 4 8 50 
CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 4 90 0 



CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC 
ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA 
CCCCCCGTTC AGCCCGACCG CTGCGCCTTA 
GTCCAACCCG GTAAGACACG ACTTATCGCC 
ACAGGATTAG CAGAGCGAGG TATGTAGGCG 
TGGTGGCCTA ACTACGGCTA CACTAGAAGG 
TCTGCTGAAG CCAGTTACCT TCGGAAAAAG 
GCAAACAAAC CACCGCTGGT AGCGGTGGTT 
ATTACGCGCA GAAAAAAAGG ATCTCAAGAA 
GGGGTCTGAC GCTCAGTGGA ACGAAAACTC 
TGAGATTATC AAAAAGGATC TTCACCTAGA 
AGTTTTAAAT CAATCTAAAG TATATATGAG 
CCAATGCTTA ATCAGTGAGG CACCTATCTC 
CATCCATAGT TGCCTGACTC CCCGTCGTGT 
GGCTTACCAT CTGGCCCCAG TGCTGCAATG 
ACCGGCTCCA GATTTATCAG CAATAAACCA 
GCAGAAGTGG TCCTGCAACT TTATCCGCCT 
TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA 
TGTTGCCATT GCTGCAGGCA TCGTGGTGTC 
CTTCATTCAG CTCCGGTTCC CAACGATCAA 
ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC 
AAGTAAGTTG GCCGCAGTGT TATCACTCAT 
ATTCTCTTAC TGTCATGCCA TCCGTAAGAT 
TACTCAACCA AGTCATTCTG AGAATAGTGT 
TTGCCCGGCG TCAACACGGG ATAATACCGC 
AAGTGCTCAT CATTGGAAAA CGTTCTTCGG 
TTACCGCTGT T G AG AT CC AG TTCGATGTAA 
ATCTTCAGCA TCTTTTACTT TCACCAGCGT 



TCATAGCTCA CGCTGTAGGT 4 95 0 

AGCTGGGCTG TGTGCACGAA 5000 

TCCGGTAACT ATCGTCTTGA 5050 

AGTGGCAGCA GCCACTGGTA 510 0 

GTGCTACAGA GTTCTTGAAG 515 0 

ACAGTATTTG GTATCTGCGC 52 0 0 

AGTTGGTAGC TCTTGATCCG 52 5 0 

TTTTTGTTTG CAAGCAGCAG 53 0 0 

GATCCTTTGA TCTTTTCTAC 53 50 

ACGTTAAGGG ATTTTGGTCA 54 0 0 

TCCTTTTAAA TTAAAAATGA 54 5 0 

TAAACTTGGT CTGACAGTTA 55 0 0 

AGCGATCTGT CTATTTCGTT 555 0 

AGATAACTAC GATACGGGAG 5600 

ATACCGCGAG ACCCACGCTC 5 65 0 

GCCAGCCGGA AGGGCCGAGC 57 0 0 

CCATCCAGTC TATTAATTGT 57 5 0 

GTTAATAGTT TGCGCAACGT 58 00 

ACGCTCGTCG TTTGGTATGG 5850 

GGCGAGTTAC ATGATCCCCC 5 90 0 

GGTCCTCCGA TCGTTGTCAG 5 95 0 

GGTTATGGCA GCACTGCATA 6 00 0 

GCTTTTCTGT GACTGGTGAG 6 0 50 

ATGCGGCGAC CGAGTTGCTC 6100 

GCCACATAGC AGAACTTTAA 6150 

GGCGAAAACT CTC7AAGGATC 62 00 

CCCACTCGTG CACCCAACTG 62 50 

TTCTGGGTGA GCAAAAACAG 63 00 



GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA 6 3 50 

ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 64 0 0 

TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 64 5 0 

TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA 6500 

ACCATTATTA TCATGACATT AAC C TAT AAA AATAGGCGTA TCACGAGGCC 655 0 
CTTTCGTCTT CAA 6 56 3 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 

20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Glu Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 

He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

95 100 105 

Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 

125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 

140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 
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Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 

230 235 240 

Glu Cys 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



CATGGTATAG GTTAAACTTA TTTACAC 2 7 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



CATGGTATAG GTNNSACTTA TTTACAC 2 7 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 5 : 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 
TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 10 0 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
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AGCTTAGTAC ATGGTATAGG TGAGACGTAT TTACACTGGT ATCAACAGAA 200 

ACCAGGAAAA GCTCCGAAAC TACT3ATTTA CAAAGTATCC AATCGATTCT 250 

CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 

CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 

ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 

GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 

CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 

GGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 

TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT AC G AG AAAC A 650 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 7 00 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 75 0 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 7 80 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CTAGTGCAGT CTGGCGGTGG CCTGGTGCAG CCAGGGGGCT CACTCCGTTT 5 0 
GTCCTGTGCA GCTTCTGGCT ACTCCTTC 7 8 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: Nucleic Acid 
<C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 7 : 

TCGAGAAGGA GTAGCCAGAA GCTGCACAGG ACAAACGGAG TGAGCCCCCT 5 0 
GGCTGCACCA GGCCACCGCC AGACTGCACT AG 8 2 
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(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8120 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



TTCGAGCTCG CCCGACATTG ATTATTGACT AGAGTCGATC GACAGCTGTG 5 0 
GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA 10 0 
GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG GTGTGGAAAG 150 
TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 2 00 
GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 2 50 
CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 300 
TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG 350 
AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 4 00 
CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTGACGTAA 450 
GTACCGCCTA TAGAGCGATA AGAGGATTTT ATCCCCGCTG CCATCATGGT 50 0 
TCGACCATTG AACTGCATCG TCGCCGTGTC CCAAAATATG GGGATTGGCA 55 0 
AGAACGGAGA CCTACCCTGG CCTCCGCTCA GGAACGAGTT CAAGTACTTC 60 0 
CAAAGAATGA CCACAACCTC TTCAGTGGAA GGTAAACAGA ATCTGGTGAT 65 0 
TATGGGTAGG AAAACCTGGT TCTCCATTCC TGAGAAGAAT CGACCTTTAA 7 00 
AGGACAGAAT TAATATAGTT CTCAGTAGAG AACTCAAAGA ACCACCACGA ^50 
GGAGCTCATT TTCTTGCCAA AAGTTTGGAT GATGCCTTAA GACTTATTGA 80 0 
ACAACCGGAA TTGGCAAGTA AAGTAGACAT GGTTTGGATA GTCGGAGGCA 8 50 
GTTCTGTTTA CCAGGAAGCC ATGAATCAAC CAGGCCACCT TAGACTCTTT 900 
GTGACAAGGA TCATGCAGGA ATTTGAAAGT GACACGTTTT TCCCAGAAAT 95 0 
TGATTTGGGG AAATATAAAC CTCTCCCAGA ATACCCAGGC GTCCTCTCTG 10 0 0 
AGGTCCAGGA GGAAAAAGGC AT C AAG TATA AGTTTGAAGT CTACGAGAAG 10 5 0 
AAAGACTAAC AGGAAGATGC TTTCAAGTTC TCTGCTCCCC TCCTAAAGCT 1100 
ATGCATTTTT ATAAGACCAT GGGACTTTTG CTGGCTTTAG ATCCCCTTGG 1150 
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CTTCGTTAGA ACGCAGCTAC AATTAATACA TAACCTTATG TATCATACAC 12 0 0 

ATACGATTTA GGTGACACTA TAGATAACAT CCACTTTGCC TTTCTCTCCA 12 5 0 

CAGGTGTCCA CTCCCAGGTC CAACTGCACC TCGGTTCTAT CGATTGAATT 13 0 0 

CCACCATGGG ATGGTCATGT ATCATCCTTT TTCTAGTAGC AACTGCAACT 1350 

GGAGTACATT CAGAAGTTCA GCTAGTGCAG TCTGGCGGTG GCCTGGTGCA 14 0 0 

GCCAGGGGGC TCACTCCGTT TGTCCTGTGC AGCTTCTGGC TACTCCTTCT 14 5 0 

CGAGTCACTA TATGCACTGG GTCCGTCAGG CCCCGGGTAA GGGCCTGGAA 15 0 0 

TGGGTTGGAT ATATTGATCC TTCCAATGGT GAAACTACGT ATAATCAAAA 15 5 0 

GTTCAAGGGC CGTTTCACTT TATCTCGCGA CAACTCCAAA AACACAGCAT 16 0 0 

ACCTGCAGAT GAACAGCCTG CGTGCTGAGG ACACTGCCGT CTATTACTGT 16 50 

GCAAGAGGGG ATT AT C G C T A CAATGGTGAC TGGTTCTTCG ACGTCTGGGG 17 0 0 

TCAAGGAACC CTGGTCACCG TCTCCTCGGC CTCCACCAAG GGCCCATCGG 17 5 0 

TCTTCCCCCT GGCACGCTCC TCCAAGAGCA CCTCTGGGGG CACAGCGGCC 18 0 0 

CTGGGCTGCC TGGTCAAGGA CTACTTCCCC GAACCGGTGA CGGTGTCGTG 18 5 0 

GAACTCAGGC GCCCTGACCA GCGGCGTGCA CACCTTCCCG GCTGTCCTAC 19 0 0 

AGTCCTCAGG ACTCTACTCC CTCAGCAGCG TGGTGACTGT GCCCTCTAGC 195 0 

AGCTTGGGCA CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGCAA 2000 

CACCAAGGTG GACAAGAAAG TTGAGCCCAA ATCTTGTGAC AAAACTCACA 2 05 0 

CATGCCCACC GTGCCCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC 210 0 

CTCTTCCCCC CAAAACCCAA GGACACCCTC ATGATCTCCC GGACCCCTGA 215 0 

GGTCACATGC GTGGTGGTGG ACGTGAGCCA CGAAGACCCT GAGGTCAAGT 22 0 0 

TCAACTGGTA CGTGGACGGC GTGGAGGTGC ATAATGCCAA GACAAAGCCG 22 50 

CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG TCCTCACCGT 2300 

CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA 2350 

ACAAAGCCCT CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGG 24 00 

CAGCCCCGAG AACCACAGGT GTACACCCTG CCCCCATCCC GGGAAGAGAT 24 5 0 

GACCAAGAAC CAGGTCAGCC TGACCTGCCT GGTCAAAGGC TTCTATCCCA 2500 

GCGACATCGC CGTGGAGTGG GAGAGCAATG GGCAGCCGGA GAACAACTAC 2 550 

AAGACCACGC CTCCCGTGCT GGACTCCGAC GGCTCCTTCT TCCTCTACAG 2600 
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CAAGCTCACC GTGGACAAGA GCAGGTGGCA 
GCTCCGTGAT GCATGAGGCT CTGCACAACC 
TCCCTGTCTC CGGGTAAATG AGTGCGACGG 
AGCTTGGCCG CCATGGCCCA ACTTGTTTAT 
AATAAAGCAA TAGCATCACA AATTTCACAA 
CATTCTAGTT GTGGTTTGTC CAAACTCATC 
GATCGATCGG GAATTAATTC GGCGCAGCAC 
GAAAGAGGAA CTTGGTTAGG TACCTTCTGA 
GAATGTGTGT CAGTTAGGGT GTGGAAAGTC 
GAAGTATGCA AAGCATGCAT CTCAATTAGT 
TCCCCAGGCT CCCCAGCAGG CAGAAGTATG 
GTCAGCAACC ATAGTCCCGC CCCTAACTCC 
CGCCCAGTTC CGCCCATTCT CCGCCCCATG 
TATGCAGAGG CCGAGGCCGC CTCGGCCTCT 
AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG 
CCGGGAACGG TGCATTGGAA CGCGGATTCC 
GTACCGCCTA TAGAGTCTAT AGGCCCACCC 
GGCTACAATT AATACATAAC CTTTTGGATC 
CCACTTTTTC TTTTTCTCCA CAGGTGTCCA 
TCGGTTCGCG AAGCTAGCTT GGGCTGCATC 
TGGTCATGTA TCATCCTTTT T C TAG TAG C A 
AG AT AT C C AG ATGACCCAGT CCCCGAGCTC 
ATAGGGTCAC CATCACCTGC AGGTCAAGTC 
GGTGCTACGT ATTTACACTG GTATCAACAG 
ACTACTGATT TACAAAGTAT CCAATCGATT 
TCTCTGGATC CGGTTCTGGG ACGGATTTCA 
CAGCCAGAAG ACTTCGCAAC TTATTACTGT 
GCTCACGTTT GGACAGGGTA CCAAGGTGGA 



GCAGGGGAAC GTCTTCTCAT 2 65 0 
ACTACACGCA GAAGAGCCTC 27 00 
CCCTAGAGTC GACCTGCAGA 2750 
TGCAGCTTAT AATGGTTACA 28 0 0 
ATAAAGCATT TTTTTCACTG 28 5 0 
AATGTATCTT ATCATGTCTG 2 90 0 
CATGGCCTGA AATAACCTCT 2 95 0 
GGCGGAAAGA ACCATCTGTG 3000 
CCCAGGCTCC CCAGCAGGCA 3 05 0 
CAGCAACCAG GTGTGGAAAG 310 0 
CAAAGCATGC ATCTCAATTA 315 0 
GCCCATCCCG CCCCTAACTC 32 0 0 
GCTGACTAAT TTTTTTTATT 32 50 
GAGCTATTCC AGAAGTAGTG 33 00 
CAAAAAGCTA GCTTATCCGG 3 3 50 
CCGTGCCAAG AGTCAGGTAA 34 0 0 
CCTTGGCTTC GTTAGAACGC 34 50 
GATCCTACTG ACACTGACAT 3 500 
CTCCCAGGTC CAACTGCACC 3 550 
GATTGAATTC CACCATGGGA 360 0 
ACTGCAACTG GAGTACATTC 3 65 0 
CCTGTCCGCC TCTGTGGGCG 3 7 00 
AAAGCTTAGT ACATGGTATA 3 7 50 
AAACCAGGAA AAGCTCCGAA 3800 
CTCTGGAGTC CCTTCTCGCT 38 5 0 
CTCTGACCAT CAGCAGTCTG 3 90 0 
TCACAGAGTA CTCATGTCCC 3 950 
GATCAAACGA ACTGTGGCTG 4 00 0 
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CACCATCTGT CTTCATCTTC CCGCCATCTG 
ACTGCTTCTG TTGTGTGCCT GCTGAATAAC 
AGTACAGTGG AAGGTGGATA ACGCCCTCCA 
GTGTCACAGA GCAGGACAGC AAGGACAGCA 
CTGACGCTGA GCAAAGCAGA CT AC GAG AAA 
AGTCACCCAT CAGGGCCTGA GCTCGCCCGT 
GAGAGTGTTA AGCTTGGCCG CCATGGCCCA 
AATGGTTACA AATAAAGCAA TAGCATCACA 
TTTTTCACTG CATTCTAGTT GTGGTTTGTC 
ATCATGTCTG GATCGATCGG GAATTAATTC 
AATAACCTCT GAAAGAGGAA CTTGGTTAGG 
ACCAGCTGTG GAATGTGTGT CAGTTAGGGT 
CCAGCAGGCA GAAGTATGCA AAGCATGCAT 
GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG 
ATCTCAATTA GTCAGCAACC ATAGTCCCGC 
CCCCTAACTC CGCCCAGTTC CGCCCATTCT 
TTTTTTTATT TATGCAGAGG CCGAGGCCGC 
AGAAGTAGTG AGGAGGCTTT TTTGGAGGCC 
TTACCTCGAG CGGCCGCTTA ATTAAGGCGC 
AACAGCTTGG CACTGGCCGT CGTTTTACAA 
TGGCGTTACC CAACTTAATC GCCTTGCAGC 
GGCGTAATAG CGAAGAGGCC CGCACCGATC 
AGCCTGAATG GCGAATGGCG CCTGATGCGG 
GTGCGGTATT TCACACCGCA TACGTCAAAG 
TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT 
CTACACTTGC CAGCGCCCTA GCGCCCGCTC 
TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT 
CCCTTTAGGG TTCCGATTTA GTGCTTTACG 
TTGATTTGGG TGATGGTTCA CGTAGTGGGC 



i 

ATGAGCAGTT GAAATCTGGA 4 050 

TTCTATCCCA GAGAGGCCAA 410 0 

ATCGGGTAAC TCCCAGGAGA 415 0 

CCTACAGCCT CAGCAGCACC 42 00 

CACAAAGTCT ACGCCTGCGA 4 2 50 

CACAAAGAGC TTCAACAGGG 4 3 00 

ACTTGTTTAT TGCAGCTTAT 4 350 

AATTTCACAA ATAAAGCATT 44 0 0 

CAAACTCATC AATGTATCTT 4 4 50 

GGCGCAGCAC CATGGCCTGA 4 500 

TACCTTCTGA GGCGGAAAGA 4 5 50 

GTGGAAAGTC CCCAGGCTCC 4 60 0 

CTCAATTAGT CAGCAACCAG 4 6 50 

CAGAAGTATG CAAAGCATGC 4700 

CCCTAACTCC GCCCATCCCG 4 7 50 

CCGCCCCATG GCTGACTAAT 4 8 00 

CTCGGCCTCT GAGCTATTCC 4 850 

TAGGCTTTTG CAAAAAGCTG 4 90 0 

GCCATTTAAA TCCTGCAGGT 4 95 0 

CGTCGTGACT GGGAAAACCC 5000 

ACATCCCCCC TTCGCCAGCT 5 0 5 0 

GCCCTTCCCA ACAGTTGCGT 5100 

TATTTTCTCC TTACGCATCT 515 0 

CAACCATAGT ACGCGCCCTG 52 0 0 

GGTTACGCGC AGCGTGACCG 52 5 0 

CTTTCGCTTT CTTCCCTTCC 53 0 0 

CAAGCTCTAA ATCGGGGGCT 53 50 

GCACCTCGAC CCCAAAAAAC 54 0 0 

CATCGCCCTG ATAGACGGTT 54 5 0 
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TTTCGCCCTT TGACGTTGGA GTCCACGTTC 
CCAAACTGGA ACAACACTCA ACCCTATCTC 
AAGGGATTTT GCCGATTTCG GCCTATTGGT 
CAAAAATTTA ACGCGAATTT TAACAAAATA 
GTGCACTCTC AGTACAATCT GCTCTGATGC 
GCTATCGCTA CGTGACTGGG TCATGGCTGC 
CCGCTGACGC GCCCTGACGG GCTTGTCTGC 
CAAGCTGTGA CCGTCTCCGG GAGCTGCATG 
ATCACCGAAA CGCGCGAGGC AGTATTCTTG 
TACGCCTATT TTTATAGGTT AATGTCATGA 
TCAGGTGGCA CTTTTCGGGG AAATGTGCGC 
TTTCTAAATA CATTCAAATA TGTATCCGCT 
AAATGCTTCA ATAATATTGA AAAAGGAAGA 
CGTGTCGCCC TTATTCCCTT TTTTGCGGCA 
TCACCCAGAA ACGCTGGTGA AAGTAAAAGA 
CACGAGTGGG TTACATCGAA CTGGATCTCA 
AGTTTTCGCC CCGAAGAACG TTTTCCAATG 
GCTATGTGGC GCGGTATTAT CCCGTGATGA 
GTCGCCGCAT ACACTATTCT CAGAATGACT 
ACAGAAAAGC ATCTTACGGA TGGCATGACA 
TGCCATAACC ATGAGTGATA ACACTGCGGC 
TCGGAGGACC GAAGGAGCTA ACCGCTTTTT 
GTAACTCGCC TTGATCGTTG GGAAC CGGAG 
CGACGAGCGT GACACCACGA TGCCAGCAGC 
AACTATTAAC TGGCGAACTA CTTACTCTAG 
GACTGGATGG AGGCGGATAA AGTTGCAGGA 
TCCGGCTGGC TGGTTTATTG CTGATAAATC 
CTCGCGGTAT CATTGCAGCA CTGGGGCCAG 



TTTAATAGTG GACTCTTGTT 550 0 
GGGCTATTCT TTTGATTTAT 555 0 
TAAAAAATGA GCTGATTTAA 56 00 
TTAACGTTTA CAATTTTATG 56 50 
CGCATAGTTA AGCCAACTCC 57 00 
GCCCCGACAC CCGCCAACAC 5750 
TCCCGGCATC CGCTTACAGA 580 0 
TGTCAGAGGT TTTCACCGTC 5850 
AAGACGAAAG GGCCTCGTGA 5 900 
TAATAATGGT TTCTTAGACG 5 95 0 
GGAACCCCTA TTTGTTTATT 6000 
CATGAGACAA TAACCCTGAT 6050 
GTATGAGTAT TCAACATTTC 6 10 0 
TTTTGCCTTC CTGTTTTTGC 615 0 
TGCTGAAGAT CAGTTGGGTG 62 0 0 
ACAGCGGTAA GATCCTTGAG 62 5 0 
ATGAGCACTT TTAAAGTTCT 63 0 0 
CGCCGGGCAA GAGCAACTCG 6 3 50 
TGGTTGAGTA CTCACCAGTC 64 0 0 
GTAAGAGAAT TATGCAGTGC 64 5 0 
CAACTTACTT CTGACAACGA 6 50 0 
TGCACAACAT GGGGGATCAT 6 55 0 
CTGAATGAAG CCATACCAAA 66 0 0 
AATGGCAACA ACGTTGCGCA 66 5 0 
CTTCCCGGCA ACAATTAATA 6^0 0 
CCACTTCTGC GCTCGGCCCT 67 5 0 
TGGAGCCGGT GAGCGTGGGT 68 0 0 
ATGGTAAGCC CTCCCGTATC 68 5 0 
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GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG 6 90 0 

ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG TAACTGTCAG 6 950 

ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 7000 

TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 7 050 

CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA 7 10 0 

TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 715 0 

CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA 72 00 

GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC 72 5 0 

CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 73 00 

TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 73 50 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT 74 0 0 

AGTTACCGGA TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA 74 5 0 

CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT ACCTACAGCG 750 0 

TGAGCATTGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 7 55 0 

ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AG C G C AC GAG GGAGCTTCCA 76 0 0 

GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 76 5 0 

ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA 77 0 0 

AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 77 5 0 

TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG TGGATAACCG 78 0 0 

TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG 785 0 

AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA 7 90 0 

CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATCCAGCT GGCACGACAG 7 95 0 

GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT 8 000 

ACCTCACTCA TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT 8 0 50 

ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA 810 0 
TGACCATGAT TACGAATTAA 812 0 
(2) INFORMATION FOR SEQ ID NO : 6 9 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: Nucleic Acid 



$ : 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 5 0 
TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 10 0 
AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 15 0 
CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 2 00 
GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 300 
TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 4 00 
ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 4 50 
GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 50 0 
ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 55 0 
TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 60 0 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 65 0 
CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 7 00 
AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCTGA 8 00 
(2) INFORMATION FOR SEQ ID NO : 7 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 

20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 



t : 

50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 

245 250 255 

Pro 



(2) INFORMATION FOR SEQ ID NO : 7 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 : 

Glu Val Gin Leu Val Gin Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 
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Ser His Tyr Met Hrs Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

3 5 4 0 4 5 

Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

5 0 5 5 6 0 

Asn Gin Lys Phe Lys Gly Arg Phe Thr Leu Ser Arg Asp Asn Ser 

6 5 7 0 7 5 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

SO 85 90 

Thr Ala Val Tyr Tyr Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 

95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val 

110 115 120 

Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 

125 130 135 

Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu 

140 145 150 

Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser 

155 160 165 

Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin 

170 175 180 

Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser 

185 190 195 

Ser Ser Leu Gly Thr Gin Thr Tyr lie Cys Asn Val Asn His Lys 

200 205 210 

Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys 

215 220 225 

Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu 

230 235 240 

Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 

245 250 255 

Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 

260 265 270 

Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp 

2-75 280 285 

Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 

290 295 3 00 



Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His 



290 



« * 



$ : 

305 310 315 

Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 

320 325 330 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys 

335 340 345 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg 

350 355 360 

Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys 

365 370 375 

Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly 

380 38 5 3 90 

Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser 

395 400 405 

Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser 

410 415 420 

Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met His Glu 

425 430 435 

Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro 

440 445 450 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

15 10 15 

Gly Asp Arg Val Thr lie Thr Cys Arg Ser Ser Gin Ser Leu Val 

20 25 30 

His Gly lie Gly Ala Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 

3 5 4 0 4 5 

Gly Lys Ala Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 

50 55 60 

Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 

65 70 75 

Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr 
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80 85 90 

Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Gin 

95 100 105 

Gly Thr Lys Val Glu lie Lys Arg Thr Val Ala Ala Pro Ser Val 

110 115 120 

Phe lie Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala 

125 130 135 

Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys 

140 145 150 

Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly Asn Ser Gin 

155 160 165 

Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr Tyr Ser Leu 

170 175 180 

Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys 

185 190 195 

Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro Val 

200 205 210 



Thr Lys Ser Phe Asn Arg Gly Glu Cys 
215 
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